Cell Probe Lower Bounds and Approximations for Range Mode

نویسندگان

  • Mark Greve
  • Allan Grønlund Jørgensen
  • Kasper Green Larsen
  • Jakob Truelsen
چکیده

The mode of a multiset of labels, is a label that occurs at least as often as any other label. The input to the range mode problem is an array A of size n. A range query [i, j] must return the mode of the subarray A[i], A[i+ 1], . . . , A[j]. We prove that any data structure that uses S memory cells of w bits needs Ω( log n log(Sw/n) ) time to answer a range mode query. Secondly, we consider the related range k-frequency problem. The input to this problem is an array A of size n, and a query [i, j] must return whether there exists a label that occurs precisely k times in the subarray A[i], A[i+1], . . . , A[j]. We show that for any constant k > 1, this problem is equivalent to 2D orthogonal rectangle stabbing, and that for k = 1 this is no harder than four-sided 3D orthogonal range emptiness. Finally, we consider approximate range mode queries. A c-approximate range mode query must return a label that occurs at least 1/c times that of the mode. We describe a linear space data structure that supports 3-approximate range mode queries in constant time, and a data structure that uses O( n ε ) space and supports (1 + ε)-approximation queries in O(log 1 ε ) time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bounds for CDFs of Order Statistics Arising from INID Random Variables

In recent decades, studying order statistics arising from independent and not necessary identically distributed (INID) random variables has been a main concern for researchers. A cumulative distribution function (CDF) of these random variables (Fi:n) is a complex manipulating, long time consuming and a software-intensive tool that takes more and more times. Therefore, obtaining approximations a...

متن کامل

Certificates in Data Structures

We study certificates in static data structures. In the cell-probe model, certificates are the cell probes which can uniquely identify the answer to the query. As a natural notion of nondeterministic cell probes, lower bounds for certificates in data structures immediately imply deterministic cell-probe lower bounds. In spite of this extra power brought by nondeterminism, we prove that two wide...

متن کامل

A Survey of Communication Complexity for Proving Lower Bound of Data Structures in Cell-Probe Model

Proving lower bounds for computational problem is always a challenging work. In this survey, we will present some techniques for proving lower bound of data structures in cellprobe model. There is a natural relationship between cell-probe model and communication complexity, so many proofs of lower bound in cell-probe model are related to communication complexity. In communication complexity, th...

متن کامل

Models and Techniques for Proving Data Structure Lower Bounds

In this dissertation, we present a number of new techniques and tools for proving lower bounds on the operational time of data structures. These techniques provide new lines of attack for proving lower bounds in both the cell probe model, the group model, the pointer machine model and the I/O-model. In all cases, we push the frontiers further by proving lower bounds higher than what could possi...

متن کامل

Asymmetric Communication Complexity and Data Structure Lower Bounds

You can think of the cell probe model as having a CPU and some random access memory (RAM). The state of your data structure is encoded in the memory cells of the RAM and the CPU must answer some query by accessing certain memory cells. For the rest of the lecture s will denote the number of memory cells and w will denote the number of bits in each memory cell. The only thing you are charged for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010